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■ ABSTRACT 

I , We study the behavior of infinite systems of coupled harmonic oscillators as the 

bj). time t ^ 00, and generalize the Central Limit Theorem (CLT) to show that their 

^ . reduced Wigner distributions become Gaussian under quite general conditions. This 

^ . shows that generalized coherent states tend to be produced naturally. A sufficient 

condition for this to happen is shown to be that the spectral function is analytic and 
nonlinear. For a chain of coupled oscillators, the nonlinearity requirement means 
that waves must be dispersive, so that localized wave-packets become suppressed. 
Virtually all harmonic heat-bath models in the literature satisfy this constraint, 
and we have good reason to believe that coherent states and their generalizations 
are not merely a useful analytical tool, but that nature is indeed full of them. 
Standard proofs of the CLT rely heavily on the fact that probability densities are 
non-negative. Although the CLT is generally not applicable if the densities are 
allowed to take negative values, we show that a CLT does indeed hold for a special 
class of such functions. We find that, intriguingly, nature has arranged things so 
that all Wigner functions belong to this class. 
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I. INTRODUCTION 

The phenomenon of decoherenee and the useful quantum states known as coherent states have 
been extensively stiidied quite separately, both being interesting in their own right, and the linguistic 
similarity of the names may by no more than a coincidence. Yet it is becoming increasingly clear 
that the link between decoherenee and coherent states is quite a close one see [1] (Zurek, Habib 
& Paz 1993, hereafter ZHP) and references therein. ZHP give an excellent and up-to-date discussion 
of this link, and indicate that decoherenee may indeed produce coherent state, since it is shown that 
the latter tend to be the most robust states when subjected to interactions with other systems. This 
link appears to have been first pointed out by Kiibler and Zeh [2]. In this paper, we will in a sense 
complete this justification of the use of coherent states and their generalizations, by explicitly proving 
that they are created under quite generic circumstances. 

1.1. Decoherenee 

Decoherenee refers to some of the changes in a system that are due to its interaction with its 
environment. Such effects may include suppression of off-diagonal elements in the spatial density 
matrix (which makes the system appear more "classical") and increase in entropy. Decoherenee is 
now widely recognized as a key to the relationship between the quantum and classical realms of physics 
(see [3] and references therein). Sources of decoherenee discussed in the literature include scattering 
([4 — 7] and others) and quantum gravity (for instance [8,9]), but most of the literature has focused 
on systems with quadratic Hamiltonians, typically coupled harmonic oscillators in a chain or some 
other simple configuration. One reason for this is that systems with quadratic Hamiltonians are just 
about the only quantum systems whose time evolution can be found analytically. Hence they have 
provided useful and tractable models. This is why harmonic chains will be the model of choice in the 
present paper as well. 

Before the interest in decoherenee, the main motivation for studying harmonic chains was the 
pursuit of a dynamical basis for equilibrium statistical mechanics. An excellent summary of the early 
developments in this area is given in [10]. A recent summary of subsequent work is given in [11] 
(Tegmark & Yeh 1994, hereafter TY), and [12] gives a more comprehensive review. In decoherenee 
applications, the basic calculational procedure is identical to that in the statistical mechanics applica- 
tions mentioned above: The idea is to study the time evolution of some small subset of the oscillators, 
called the system, by taking a partial trace over the rest of the oscillators, called the heat bath or 
the environment. In statistical mechanics applications, the goal is to investigate whether the system 
exhibits standard thermodynamic features such as Brownian motion and approach to thermal equi- 
librium. In decoherenee applications, the emphasis is on the behavior of the reduced density matrix 
of the system and on the extent to which certain quantum phase correlations are destroyed. 

1.2. Generalized coherent states 

For historical reasons, states whose Wigner functions [13—15] are Gaussian have been given 
many different names. The single-oscillator ground state is a Gaussian centered on the origin. When 
translated in the q and p directions in phase space, it is usually called a coherent state. When rescaled 
so that it is shortened in the q direction and elongated in the p direction (or vice versa), it is known 
as a squeezed state. When subjected to the most general linear canonical transformation (translated, 
squeezed, and rotated), it is sometimes known as a tiltedly squeezed state. When expanded, it is called 
a thermal state, and is no longer pure. The translated ground state of a many-oscillator system is 
sometimes called a multimode coherent state. And so on. Thus the most general state with a Gaussian 
Wigner function might be termed a multimode tiltedly squeezed mixed state. We will simply refer to 
all these states as generalized coherent states, or Gaussian states for short. 

As is indicated by the profusion of names for them, Gaussian states have been intensely studied 
in many areas of physics, from quantum optics to statistical mechanics. One reason for this is (just as 
with harmonic chains) analytic tractability: if a state is Gaussian at some given time, it will always 
remain Gaussian if the Hamiltonian of the system is quadratic, so it is sufficient to compute the time- 
evolution of the mean and the covariance matrix, which specify the Gaussian uniquely. Another reason 
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for their popularity is that coherent states, iiiventc;(i by Schrodinger in 1926 [16] and further developed 
by Glauber [17], have been seen as a clue to understanding the classical limit of quantum mechanics. 
This is because they, as opposed to for instance energy eigenstates, exhibit fairly "classical" behavior. 



1.3. The connection 

Another Gaussian distribution, the Maxwell-Boltzmann velocity distribution, is well-known to 
arise dynamically from the interactions of many independent particles, along the lines of the Central 
Limit Theorem. Thus, in the spirit of ZHP, a natural question to ask is whether generalized coherent 
states also tend to be produced dynamically, from interactions within many-body systems. In this 
paper, we will address this question in a case where much of the necessary mathematical machinery 
is already in place: the case where the many-body system is a harmonic chain. 

The paper is organized as follows: In Section II, we review some basic results about classical and 
quantum harmonic chains and establish some notation. In Section III, we prove the main result of 
the paper for the classical case. In Section IV, we show that the same result is true for the quantum- 
mechanical case as well. Finally, in Section V, we give a more heuristic and qualitative discussion 
of what happens for finite systems and for chains lacking translational invariance. Some necessary 
mathematical results are proven in the appendices: In Appendix B we place a constraint on the 
dispersion relationship, and in Appendix C we prove a generalization of the Central Limit Theorem 
for the case where the "probability density" can take negative values. 

II. THE GENERAL HARMONIC CHAIN 

In this section, we establish some notation and review some basic results about classical and 
quantum harmonic chains and cyclic matrices. 

As our quantum system, let us take 27V -|- 1 coupled harmonic oscillators of equal mass, labeled 
—N, —1, 0, 1, N. Denoting a point in the 2 x {2N + l)-dimensional phase space by 




and the corresponding operators by 




we can write the Hamiltonian as 

H = — p^p + — ^q^Aq, 3 
2m 2 

where the time-independent matrix A is symmetric and positive definite. Throughout this paper, we 
will use units where ni = uj^ — h — 1. The number of oscillators can be either finite or infinite, but 
we will limit ourselves to the infinite case except in Section V. 

At any given time, we will specify the (pure or mixed) state of the system by its Wigner function 
W^(z). It is well-known that since the Hamiltonian is quadratic, the equation of motion for the 
Wigner function is identical to that of the Liouville function in classical statistical mechanics and has 
the solution 

Wt{z) = Wo{U{t)-'z), (4) 
where the time-evolution matrix U is given by^ 

^(^^ -[z X) = \-A^/^smA^/H cos A^H ) ' 

^ Here and throughout this paper, the action of a function on a symmetric matrix is defined as 
the corresponding real- valued function acting on its eigenvalues: Since all symmetric matrices A can 
be diagonalized as 

A = RAR^, 
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By a Gaussian state, in n diiiiCTisions (wc will often have n < 2N + 1 further on, when dealing with 
reduced Wigner functions), we will mean a state whose Wigner function is Gaussian, i.e. is of the 
form 

W{z) = (27r)-"(det C)-^/^ 1 



--(z-/.rC-i(z-M) 



(7) 



Here the mean vector fj, and the covariance matrix C satisfy 

r M = (z), 

C = I I 

(qp"^ + pq"^) (pp"^) 



(8) 




(The symmetric ordering is necessary since q and p do not commute.) The Wigner function being 
Gaussian is equivalent to the density matrix being Gaussian in the position (or momentum) represen- 
tation. 

By a time-independent state, we will mean a state with a time-independent Wigner function (or, 
equivalently, with a time- independent density matrix). In TY it is shown that a necessary but not 
sufficient condition for a state to be time-independent is that 



(9) 



where D is some constant, symmetric, positive definite matrix that commutes with A. If the state 
is Gaussian, then this is evidently also a sufficient condition, since the Wigner function is completely 
specified by fi and C. We will assume that all states have ^ = 0. This in no way reduces the generality 
of our treatment, as the time-evolution of /i and the time-evolution of the shape of the Wigner function 
(about its center /x) are totally independent (see TY). Thus assuming /i = is much like assuming 
that the center of mass is at rest at the origin when studying the motion of a blob of jello in the 
absence of external forces. 

As is conventional, we will assume that the harmonic chain is translationally invariant. This is 
equivalent to the potential matrix A being cyclic^ , i.e. that each row is a cyclic permutation of the 
row above it: Ai+ij+i = Aij, understood (mod n) for an n x n matrix. Since A is also symmetric, 
this means that we can write Aij = a|i_j| and interpret the system as a chain of harmonic oscillators 
where the coupling between any two oscillators depends only on the separation between them. (If N 
is finite, we can interpret the system as oscillators arranged in a ring rather than a line.) Using (5), 
we can write any function of a (cyclic or non-cyclic) matrix A as 

/(^)mn = RmkRnkfjdk)- (10) 



where R is orthogonal and A = diagjdj} is diagonal and real, we can extend any mapping / on the 
real line to symmetric matrices by defining 

f{R diag{dji?^) = R diag{/(dO}^^^- (5) 

It is easy to see that this definition is consistent with power series expansions whenever the latter 
converge. For example, 

(-1)' 
(2n)! 



cosA^/^ = y^-±>-A-. 



n=0 



Such matrices are often called circulant in the mathematics literature [18]. 
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Cyclic matrices have the great advantage that they aU commute. This is becaiise they can aU be 
diagonahzed by the same matrix R, an orthogonal version of the discrete Fourier matrix. Physically, 
this means that plane waves form a complete set of solutions. If A is symmetric, positive-definite, 
cyclic and infinite-dimensional, then Eq. (10) reduces to [19] 



1 

2^ 



dOf [X^{0)] cos{m-n)e, 



(11) 



where the spectral function A^(0) is the function whose Fourier coefficients are row zero of A. The 
spectral function can be interpreted as a dispersion relationship, A being the frequency of a wave with 
wave number 9. Note that f{A) is cyclic as well, i.e. its components depend only on the distance to 
the diagonal. 

A cyclic potential frequently discussed in the literature is the nearest neighbor potential, the case 
where each mass is coupled only to a fixed spring and to its nearest neighbor: 



fc= — oo 



1 ^2 1 «2 7 - n2 



(12) 



i.e. Akk = 1 + 27^, Ak,k±i 
spectral function is 



-7^ and all other elements of A vanish. For this special case, the 



A'(^) = l + 472sin2-. 



(13) 



III. THE INFINITE CLASSICAL CHAIN 

In this section, we will investigate the circumstances under which states become Gaussian in 
classical statistical mechanics. Here the positions and momenta at time t are specified by z{t), which 
is a vector of random variables. These random variables are given by the initial random variables as 

z{t) = U{t)z{Q), 

and we wish to study the circumstances under which the probability distribution of z{t) becomes a 
multivariate Gaussian as t ^ 00. 

According to equation (6), the position of oscillator m at time t is given by the initial data as 

oc 
n= — 00 

where we have defined the random variables 

^mn{t) = XmnQniO) + i^rrmPn(O). (15) 

(The above expression is to be understood without any summation.) Using the Liapunov version of 
the Gentral Limit Theorem {e.g. [20]), we see that the distribution of qm{t) becomes Gaussian as 
t ^ 00 if the Liapunov condition 

-^^0 as t^oo (16) 

is satisfied, where we have defined 



M('=)(i) = 



E (i^-(*)i') 



-1 i/fe 



(17) 
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If wc make the physically reasonable assumption about the second and third moments of the initial 
data that (zfc(O)^)^/^ > a and (|-z;c(0)|^)^/^ < k, for some positive constants a and k, then then the 
Liapunov condition reduces to the requirement that 



(Er— rV|y-„„r-') 



1/2 



1/3 

^0 as t^oo. (18) 



(For the quantum case to be treated in the next section, the assumption of a minimum standard 
deviation follows directly from the Heisenberg uncertainty principle, if we simply assume that the 
standard deviations are bounded from above.) 
If A is cyclic, then (6) and (11) yield 

J Xmn = ^ cos[\{6)t] cos(m - n)6 dO, 

[Yrr^n = ^ m~' sm[X{e)t] cos(m - n)e dO. ^^^^ 
Now Parseval's theorem gives 

^ Xl^ + Y^^=- [cos'mt] + Xiey' sin'mt]) de, (20) 

n=-oo •'-^ 

which approaches some positive constant c as t ^ cxo. Thus the Liapunov condition is satisfied if and 
only if the numerator of Eq. (18) approaches zero as f — > oo. But 

oo / \ oo 

^ \Xmnf + \Y^nf < (sup |X„„| + \Y^^\] ^ X^^ + Y^^, (21) 
n=— oo ^ ^ ri=— oo 

SO it suffices to show that this supremum approaches zero^ In Appendix A we show that this supremum 
does indeed approach zero under quite general conditions, namely for any spectral function A that is 
analytic on the entire interval [—■ tt, tt] and in addition is non-linear. 

In conclusion, we have shown that the probability distribution of qm{t) becomes Gaussian as 
t ^ oo if the spectral function A is non-linear and analytic on [—tt, tt] and if the initial probabil- 
ity distributions of all positions and momenta are independent and have bounded second and third 
moments. The proof that Pm{t) becomes Gaussian is completely analogous. 

The assumption that all random variables are independent can be relaxed to assuming that no 
dependence exists at oscillator separations larger than some fixed integer M. More precisely [21,22], 
it is sufficient that there exists an M such that n — m > M implies that the two infinite sets 

(..., q{0)m-i,p{0)m-i, q{0)m,p{0)m) and (g(0)„,p(0)„, q{0)n+i,p{0)n+i, •••) 

are independent. Finally, our proof can readily be generalized by using the multivariate CLT [23, 24] 
to show that all finite multivariate distributions become Gaussian. 



^ This is not merely a sufficient condition but also a necessary condition for qm{t) to become 
Gaussian, since a non-zero supremum means that there is some ^mn that makes a finite contribution 
to the sum (14). This would imply that the distribution of qm{t) depends on the details of the 
distribution of ^mn and thus cannot in general be Gaussian. 
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IV. THE INFINITE QUANTUM CHAIN 

In this section wc will see that all the results of the previous section can be generalized to 
the quantum-mechanical case. Much of the mathematics remains the same, but the interpretation 
changes. The big mathematical difference is that a Wigner function can take negative values, whereas 
a classical probability distribution cannot. A generalization of the Central Limit Theorem for Wigner 
distributions is proved in Appendix C. 

By analogy with reduced density matrices, all expectation values of the n*'' oscillator can be 
calculated from the n*'' single oscillator reduced Wigner function [14, 15] 

W^^\qn,Pn)= [ W{ci,p), (22) 

J{n) 

where the integral is to be taken over all variables except x„ and p„. This is analogous to the way 
the marginal probability distribution for (a;„,p„) is calculated in classical statistical mechanics. The 
only difference is that the Wigner function can take negative values and cannot be interpreted as a 
probability distribution. In Section III, wc gave necessary and sufficient conditions for when various 
marginal distributions become Gaussian as t ^ oo. Here we will pursue the quantum analog and give 
conditions for when various reduced Wigner functions become Gaussian. 
Fourier transforming equation (22) with respect to all variables yields 

W^''\qn,Pn) = W{0, 0, qn, 0, 0,Pn, 0, 0), (23) 

i.e. the Fourier transformed Wigner function (also known as the characteristic function) with all 
variables except 5,, and p„ set equal to zero. This expression is often more \iscful than (22), as it 
contains no integrals. Fourier transforming equation (4) and using the fact that det U = 1 yields 

Wt{z) = Wo [U{tfz\ , (24) 

where denotes the transpose of U. 

Let us first assume that the oscillators are not entangled initially, so that the Wigner function 
for the initial state is completely separable, i.e. of the form 

W^o(q,p)=n^o"^(9".Pn) (25) 

n 

for some set of reduced Wigner functions Wq"'\ Substituting equation (25) into equations (23) and 
(24) yields 

miPm) Qm ~l~ ^mnPmi ^mnPm ~t~ -^mnPrn) ^ (^^) 

n 

where the matrices X, Y and Z are those defined in equation (6), and no summation is implied. Thus 
the reduced Wigner function is obtained by Fourier transforming the initial reduced Wigner functions, 
multiplying them together, rcscaling their arguments appropriately, and performing an inverse Fourier 
transform on the result. This is exactly how we would compute the probability density for a weighted 
sum of independent two-dimensional random variables, which is the classical case that we investigated 
in the previous section. The standard versions of the CLT all make heavy use of the assumption 
that probability densities are non- negative. Thus in order to show that the reduced Wigner function 
becomes Gaussian, we need a Liapunov type CLT for "random variables" whose "probability densities" 
are allowed negative values, a subject which to our knowledge has not been previously studied. Wc 
leave the full mathematical details of such a study for a future paper, but prove such a generalized 
CLT in Appendix C for the special case where all the "random variables" are identically distributed. It 
appears highly plausible that the standard Liapunov proof can be appropriately generalized employing 
similar techniques. 

In conclusion, this would show that any one-particle reduced Wigner function Wf"'\qn,Pn) be- 
comes Gaussian as t — > 00 if the spectral function A is non-linear and analytic on [— tt, tt] and if the 
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initial state's of all oscillators satisfy the condition that certain expectation valnes be bounded as 
described in Section III. Specifically, the expectation values of all linear, quadratic and cubic com- 
binations of Qn and Pn should be bounded from above by some constant independent of n. Then as 
shown in Theorem III in appendix C, the other moment constraints will be automatically satisfied 
because of the Heisenberg uncertainty relationship. 

Just as in the classical case, the assumption that no initial correlations exist between different 
oscillators can be relaxed to assuming that the joint Wigncr functions are separable for oscillator 
separations larger than some fixed integer M. The generalization to the reduced Wigner function for 
more than one particle is also completely analogous. 

Our result shows that virtually all harmonic chains treated in the literature will produce Gaussian 
states as t — > oo, since they tend to have spectral functions that are both analytic and nonlinear. 
Some well-known examples of such harmonic chains are the above-mentioned nearest neighbor model 
[25,10] and the FKM model [19]. Since the FKM model has been shown to be equivalent to the 
independent-oscillator heat bath model [12], the latter will also produce Gaussian states under quite 
general conditions. 

An interesting mathematical problem is to generalize our results to arbitrary quadratic systems, 
by giving conditions for when they produce Gaussian states. It is our belief that Gaussian states will 
be seen to be produced under quite generic circumstances, and thus are ubiquitous whenever there is 
interaction between a very large number of systems. 

In TY, it is shown that if a harmonic chain starts out with an arbitrary cyclic covariance matrix 

then 

^^(o ad) (27) 

where 

D = ^[E + A-'F]. 

If the spectral function is nonlinear and analytic as discussed above, the convergence will not merely be 
pointwise as shown in the TY, but indeed uniform. Since a Gaussian is uniquely specified by its mean 
vector /i and its covariance matrix C, we thus know not only that the harmonic chain approaches a 
Gaussian state, but also exactly which Gaussian state. As we would expect, the only information that 
is preserved about the initial data is the second moments, i.e. the covariance matrix, whereas all fine 
details of the Wigner function and all information about higher moments are lost. Note that the initial 
data enter only in the combination E + A~^F, so all information about G (initial position- momentum 
correlations) is lost as well. 

Without loss of generality, we assumed that the mean vector /i = in the above treatment. The 
effect of relaxing this assumption is discussed in TY. It is seen that whereas the covariance matrix 
still converges to the value given above (and from what we have shown, all higher central moments 
converge to the values required by Gaussianity) , the mean vector fj, does not converge towards a 
constant, but keeps oscillating forever. 



8 



V. DISCUSSION 



In this paper, wc have shown that any part of a generic harmonic chain wiU evolve into a Gaussian 
state as f — > (X). Given that the spectral function is mathematically well-behaved (analytic on the 
interval [— tt, tt]), "generic" is to be interpreted as forbidding two special cases: 

1) The spectral function is linear. 

2) Fine-tuned long-range correlations exist in the initial data. 

We will now attempt to give a more intuitive and physical interpretation of these two conditions 
(which apply for infinite chains), as well as qualitatively discuss what happens if N is large but finite. 

The gist of the CLT as we have used it is that a weighted average of infinitely many independent 
random variables approaches Gaussianity as t ^ oo if all weights become infinitesimal. Very loosely 
speaking, a sum of infinitely many infinitesimally small independent random contributions is Gaussian. 
In terms of our harmonic chains, information about the initial data must be mixed, and mixed so 
thoroughly that the state of any subsystem of the chain at t = will have only an infinitesimal 
impac;t on the state of any subsystem of the chain as t oc. Physically, what can go wrong? In 
the extreme case A (x I, which corresponds to the oscillators being completely uncoupled, there is no 
mixing of information whatsoever and the CLT fails miserably. Now one might think that as long as 
an oscillator is coupled to at least one other oscillator (and thus indirectly to an infinite number of 
oscillators through it, by translational invariance), the CLT should always apply, and Gaussians should 
be obtained for any cyclic potential matrix except Aoc I. This is false. If the spectral function is linear 
(or, dropping the analyticity requirement, if it is linear on any finite interval), then a wave-packet 
composed only of wavenumbers in this interval will simply travel down the chain without dispersing, 
retaining its initial shape forever. Thus the initial data at one point will have a non-infinitesimal 
impact on the state somewhere else, even at arbitrarily late times. This is reflected as Umn —>■ 
as t — > 00 for any fixed m and n as shown in TY, while sup^ „ \Umn\ remains bounded away from 
zero, as elements of order unity merely propagate further and further away from the diagonal, at 
a linear rate. In summary, the key is that the propagation of waves must be dispersive, i.e. the 
dispersion relationship must be non-linear. This will ensure that all localized wave packets gradually 
get destroyed. 

The second constraint, that on the initial data, is closely related to the second law of thermody- 
namics: although for most initial data, the entropy of isolated gas in a container will not decrease, 
there is a small set of rather contrived initial data for which it will, and time will appear to run 
backwards for a while. The easiest way to obtain such initial data is to let a low-entropy state evolve 
into a high-entropy state and then reverse all velocities. The situation with our harmonic chains is 
completely analogous: If an uncorrelated state is allowed to evolve, the entropy of the subsystems will 
increase as each oscillator becomes increasingly correlated with ever more distant neighbors. If we 
now replace W{c[, p) by exactly W{c[, — p) (approximately will not suffice), the system will evolve back 
into the uncorrelated (and perhaps non-Gaussian) system we started with. Apparent time-reversal 
is always caused by such long-range correlations, and since wc used a version of the CLT that bans 
such correlations, such troubles are avoided altogether. Of course, after the uncorrelated initial state 
has been obtained, new correlations begin to arise again, and the subsystems eventually approach 
Gaussianity. An interesting problem is to investigate whether, in this vein, our result can be proven 
to hold for any cyclic initial conditions whatsoever. 

The result that subsystems become Gaussian as f — > oo holds strictly only for infinite chains. So 
what happens when N is finite but very large? If the waves arc dispersive, then the discussion of finite 
N in TY can readily be extended to show that ma,Xm,n Umn will evolve as follows when N is large: 

(i) During an initial transition period whose duration is of the order of the dynamical time scale 

lUq^ , it decays from its initial value of order unity to a value of order iV~^/^. 

(ii) After that, it oscillates around this value with an oscillation amplitude of the same order. 

(iii) Since the time evolution of Umn is almost periodic, some components must return to values 
of order unity an infinite number of times. This happens approximately once every Poincare 
recurrence time. However, as shown by [26], the Poincare time scale is generally enormous 
compared to the dynamical time scale, since it tends to grow exponentially with N for systems 
of this type. 
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In a discussion of density matrices [27], Fcynman writes: "When we solve; a quantum-mechanical 
problem, what we really do is divide the universe into two parts - the system in which we are interested 
and the rest of the universe. We then usually act as if the system in which we are interested comprised 
the entire universe." In this spirit we summarize our harmonic chain result: The effect of "the rest of 
the universe" is to make our subsystem approach a generalized coherent state. Since most systems in 
the real world are coupled to their environment, this gives us even more reason to believe that nature 
is indeed full of generalized coherent states. 



The authors would like to thank Emory Bunn, Leehwa Yeh and Wojtek Zurek for useful comments 
on the manuscript. 



APPENDIX A 

In this appendix we give a condition for when sup,^ „ \ U{t)„in\ as t ^ oo. This rests on 
Theorem (I), which is proved in Appendix B. Since according to Eq. (6) and Eq. (11), 



'Xmn = ^ 11^ cos[X{9)t] cos[(m - n)e] dO, 
< Ymn =^Jl^sm[X{e)t]cos[{m-n)e]X{e)-'de, 
.Zmn = ^ sm[\{e)t] cos[(m - n)e]\{e) dO, 



we wish to show that 



sup 

k 



J — TT 



,iX(8)t -ike 



9{0)de 



as f 



where k = ±(m — n) is any integer, g{6) = \{0Y , and v = v = —1 and y = 1, respectively. Setting 
f{d) = X(0), Theorem I shows that sup„j „ \X{t)mn\ — > as i — > oo if A is a non- linear analytic 
function on the entire interval [— 7r,7r]. The same holds for Z{t)mn, i-e- the v = 1 case. Since A 
is positive definite, A is bounded from below by some positive constant, so A~^ is also analytic and 
suPm „ |y(i)mn| ^ as t ^ OO foUows Under the same conditions. In summary, sup„ „ \U{t)mn\ 
as i — *■ oo for any bounded non-linear analytic spectral function A. 

It is noteworthy that the non-linearity requirement is crucial to ensure that the convergence to 
zero is uniform, independent of m and n. By simply changing variables and using Ricmann-Lebesgue's 
Lemma, it is readily seen that U {t)mn will approach zero as t — > oo for any fixed m and n, even if A 
is linear. However, as was discussed in Section V, this alone is not suflacient for producing Gaussian 
states. 



10 



APPENDIX B 

In this Appendix, which is purely mathematical, we prove the basic convergence theorem upon 
which the conclusion of the paper rests. The theorem is of course a version of "Van der Corput's 
Lemma" [28] in the theory of oscillatory integrals, but the uniformity with respect to k (which we 
believe is new) requires quite delicate handling. 

Theorem I. Let / be a function analytic on a neighborhood of the closed bounded interval / 
of the real axis, and real-valued on I. Assume / is not a polynomial of degree < 1. Then, for any 
g e C^(J), we have 



sup 
feeR 



ikx 



g{x)dx 



as t ^ ±00. 



Actually, as the proof will show, the left-hand side is O {\t\~'') for some a > 0. 

In the proof, wc may restrict attention to f > 0, as the other case then follows if wc replace / by 
— /. By hypothesis, there is an open simply connected domain D containing / such that / is analytic 
on a neighborhood of the closure D of D. 

Lemma Bl. There is an integer I such that, for every w €: C, f {z) — w = Q has at most (. roots 
(counting multiplicities) in D. 

Proof. This is a simple exercise in complex analysis. 

Lemma B2. If F is of class C'^{J) and real- valued for some closed bounded interval J C R, 
g e C^(J), and F is strictly monotone on J, then for t > we have 



iF{x)t 



g{x)dx 



< 



where 5 denotes the smaller of 1 and min |F'(a;)| for x € J. Here Ci is a constant depending only on 
g and the number max|i^"(a;)| : x £ J. 

Proof. This is a standard estimate of "Van der Corput type" (see for instance [28]. This is 
a rather primitive version, the proof being a straightforward variable change y = F{x) followed by 
partial integration. With stronger hypotheses one can get 6 rather than 6'^ in the denominator, but 
this is not required for our purposes. 

In the following, a number of constants whose precise values are not essential will arise. Constants 
denoted C\, C2,... will all be independent of k (later u), depending only on the functions / and g and 
the geometric entities /, D. 

Proof of Theorem. We must estimate the integral 



''gix)dx = / e*^(^)*c/(a;)dx, 



where F{x) = f{x) — ux, and u = kjt is a real parameter. By Lemma Bl, the number of complex 
zeroes z to /'(z) — m = in l) is bounded by an integer I independent of u. Denote the distinct zeroes 
by Zj = Zj{u); j = l,...,s, with corresponding multiplicities mi,...,ms and '^i — ^- Now, fix 

e > and let Aj denote an open disk of radius e centered at Zj. Then, /\Uj=i ^3 consists of a union 
of r < / ± 1 pairwise disjoint closed intervals Jj, on each of which F is strictly monotone. Moreover 
we have the estimate 

\F'{x)\ > C2e^ (S3) 



for all x in these intervals. We will show this after completion of the argument. By (B2) we have for 
small £ 

e'^^^^'g{x)dx < 



Summing over i, and noting that / \ (J J, has length < 2is, we get 



I/' 



iF{x)t 



g{x)dx 
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For fixed (large) t, choose here e = f.^^/'-^''+^'> and we see that the left-hand term in (B4) is bounded 
by Cst"^/^^^"*"^-*. This concludes the proof of the Theorem. 

We now supply the proof for the estimate (B3). Let us define the polynomial 

s 

P{z) = P{z,u)^l[[z-zj{u)r^. 

It is clear that there is some constant Cq such that P{z, u) < C% for all ^; G .D and for all u. Now, 
consider the function P{z,u)/{f'{x) — u). It is analytic in D. Moreover, for some constant C7, 

ma^-g^^<Cr. (B5) 
xei \f'{x)-u\ 

(We will return to the proof of (B5) shortly.) Thus, for x e I, 

\f'{x)-u\>C^^\P{x,u)\>C2S^ 

for some constant C2 when x € I\{UAi). Thus all that remains in order to prove (B3) is to show 
that (B5) holds. This can be done as follows. Let Fi, F2,...,F^+i be pairwise disjoint simple closed 
curves in D, each of which encloses /. 

Lemma B3. There is a positive constant Cs such that for any u, 

min\f'{z)-u\>C8 

holds for at least one value of j. 

Proof. Let us define (pj{u) = min \ f'{z) — u\ : z € Tj. It is easy to see that (pj is continuous. 
Hence, so is 

(p(u) = max ipn(u). 
i<j<e+i 

Moreover, (p{u) > 0, because if ip{u) = for some u, then all ^j{u) are zero, so f'{z) — u vanishes at 
least once on each Fj and thus has at least £ +1 zeroes, a contradiction. Since ip{u) is continuous, 
positive and obviously ^00 as |m| ^ co, it attains a positive minimum value C%. We thus have that 
for every u, there is at least one j = j{u) such that ^Pj{u) < Cg, which proves the Lemma. 
By the maximum modulus theorem, for any u e C, 

max IS£l^<max.l^(^'")l 



zei \nz)-u\ - .er, |/'(^) - «!' 

where we choose j = j{u) as in Lemma B3. Thus on the right hand side, the numerator is bounded 
from above by Cf, and the denominator is bounded from below by Cg, so the entire expression is 
< Cr = Cq/Cs- This completes the proof. 
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APPENDIX C 

In this Appendix, we prove a generalized version of the Central Limit Theorem (CLT) that holds 
for Wigner Distributions. Although the CLT can not be generalized to arbitrary functions that are 
allowed to take negative values, we show that repeated convolutions does indeed lead to Gaussianity 
for a special class of such functions. We find that, for some reason, nature has arranged things so that 
all Wigner functions belong to this class. 

Given a function / on R'*, its zeroth, first and second moments are defined as 



m(°) = J f{x)d'^x, 
Mf) = J f{x)xid'^x, 
M^f = j f{x)xiXjd'^x 

if the moments exist, i.e. if these integrals are convergent in the Lebesgue sense. In probability theory, 
the second central moment matrix Vij = M-p — M-^^ Mj^^ is usually called the covariance matrix. Let 
us define a quasi-probability density on R"^ as a real-valued function / having the following properties: 

* M(o) = 1. 

* The first moments M-^^ exist. 

* The second moments exist and the covariance matrix is strictly positive definite. 

* For reasons that will become clear later, we will also make the technical assumption that / is an 

L'^ function, i.e. square-integrable. 
If / has the additional property that it is non- negative, i.e. that it is a probability density, then the 
basic version of the CLT states that if we define /„ to be / convolved with itself n times and translated 
and rescaled so as to have the same first and second moments as /, then /„ approaches a Gaussian g 
as n ^ oo. The convergence is usually shown to be in the weak topology of measures, which in our 
context means that integral of /„ times any bounded testfunction tends to the corresponding integral 
for g. We wish to investigate under which circumstances /„ approaches a Gaussian if we drop the 
assumption of non-negativity. 

Without loss of generality, we may assume that M-'^^ = and that Vij = Sij, the identity matrix, 
as the general case can be obtained from this by a simple change of variables. By Fourier transforming 
and using the convolution theorem, one then obtains the standard expression 

/„(k) = /(n-i/2k)". (CI) 

Our problem decomposes into two parts: 

(A) To give conditions for when /„(k) g(k) = e~^ ^"^ as n ^ oo. 

(B) To show that this convergence to Gaussianity on the Fourier side really implies that fn ^ 9 

in some meaningful sense. 
It is important to note that (B) is not merely an unphysical mathematical detail. This is illustrated 
by the following counterexample: Take d=l and chose f{k) to be any smooth, symmetric function 
such that /(O) = 1, /'(O) = 0, /"(O) = —1 and > 1 for some constant A;, > 0. An example of 

such a function is f{k) = [1 + k^)er^ It is easy to see that its inverse Fourier transform / will 
have all the properties of a quasi-probability density. It is also easy to show that fn{k) g{k) as 
n — > 00 pointwise, for any fixed k, since f{k) = 1 — + 0{k^) follows from our assumptions, and 

1 ^ e ^ ' as n —^ oo. 

2nJ 

Yet Eq. (CI) clearly shows that the part of the curve that exceeds unity will grow ever larger as n 
increases. Pointwise convergence is obtained merely because the growing |/„| > 1 hump keeps shifting 
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out to higher and higher frequencies k. Thus as n grows large, /„ may look quite Gaussian on the 
interval \k\ <C n^/^fc*, but there will be exponentially growing bumps of height f{k^)^ a,t k = ±n^/^A;*. 
Inverse Fourier transforming, this means /„ will behave like a sum of a Gaussian and violent noise, 
whose frequency and amplitude increase without bounds as n ^ oo. 

We will refer to a quasi-probability density as proper if the absolute value of its Fourier transform 
takes its maximum only at the origin. Thus / is proper if |/(k)| < 1, with equality only for k = 0. 
If a quasi-probability density never takes negative values (and hence is a probability density in the 
conventional sense), then it is easy to show that it will automatically be proper. The "ultraviolet 
catastrophe" described above shows that a necessary condition for a CLT to hold is that |/| never 
exceeds unity. Thus being proper is a necessary condition, except perhaps for the borderline case 
where |/(k)| < 1 but actually equals unity for some k ^ 1. In what follows, we will show that being 
proper is also a sufficient condition. We will also see that, interestingly, all Wigner quasi-probability 
densities are proper. 

In what follows, the function g will always denote the d-dimensional Gaussian 

Unless otherwise indicated, all integrals below are to be taken over all space. || • ||2 will denote the L'^ 
norm in R'', defined by 

Il/ll2=(/ |/(x)|Va; 

Lemma CI: If / is a proper quasi-probability density, then for any £ > 0, there exists a. 5 > 
such that |/(k)| < 1 - (5 for all |k| > e. 

Proof: If / is integrable, then /(k) ^ as |k| — > oo by Riemann-Lebesgue's Lemma, so the 
continuous function |/(k)| attains some maximum value on the set {k : |k| > e}. Mg < 1 since / 
is proper, so we can choose S = 1 — M^. Alternatively, if we do not wish to assume that / is integrable, 
it is straightforward to show that /(k) ^ as |k| ^ cxd if / is any Wigner function. 

Lemma C2: The norms ||/n||2 are bounded by a constant independent of n. 

Proof: For small k, / has the asymptotic behavior /(k) = 1 — + o(fc^). Thus it is easy to 
see that given any constant p < 1, there exists an Sp such that 

for all |k| < Ep. For all other k, we have 

\mf<{i-6pr 

for some 5p > hy Lemma CI. Combining these two bounds, we obtain 

ll/n||2 = / \fn{^)\^d''k < [ e-P^'d^k + (1 - <5p)2("-i) / |/V/'k)|2rf<^fc. 

J ^|k|<Vnep J\]^\>^/nSp 

Extending both integrals to all of space and changing variables in the second one, we get 

ll/nlli < We-P'^/'Wl+n^'/'il-Spf^"-'^ ml (C2) 

Since the last term — > as n — *■ oo, the left hand side is bounded by a constant independent of n. 
Lemma C3: /n(k) g(k) pointwise as n ^ oo. 

Proof: This step is identical to that in proofs of the classical CLT (see for instance [20]), so we 
omit it. 

Lemma C4: /„ — > ^ in weak L'^ topology as n — > oo. 




14 



Proof: By a standard result in functional analysis [29]. weak convergence (that /„ — (} inte- 
grated against any test function approaches zero) follows from the pointwise convergence (Lemma 
C3) and bounded norms (Lemma C2). 

Lemma C5: ||/„||2 ^ Hfflb as n ^ oo. 

Proof: Letting p t 1 in Eq. (C2) and invoking Fatou's Lemma, 

limsup||/„||2 < llfflh- 

n^oo 

But since /y^ ^ ^ in weak topology, we have 

||5||2<liminf||/„||2. 

n— ^oo 

The two preceding inequahties imply hmsup ||/,i||2 < Hminf ||/,i||2- Since the reverse is true always, 
liminf = limsup, which implies that lim ||/„||2 exists and equals ||5||2- 
Theorem II: If / is a proper quasi-probability density, then 



\fn{-x) - g{'x.)\ drx as n ^ oo, 

i.e. fn approaches a Gaussian in norm. 

Proof: Because of the Plancherel Theorem (L^-unitarity of the Fourier transform), this is equiv- 
alent to 

||/n-5||2^0 as n^oo, 

i.e. that /„ — > 5 in strong topology. But by a standard functional analysis result, this follows from 
weak convergence (Lemma C4) combined with convergence of the norm (Lemma C5), so the proof 
is complete. 

Thus we have shown that /„ approaches Gaussianity in the strong LP' sense. Note that in Lemma 
1, we used the technical assumption that / was either integrable or a Wigner function. If we wish to 
make the additional technical assumption that not only is / (and hence /) in but, for some e > 
(however small) |/p|k|'^ is also integrable over R"^, then we can show the following: /„ converges not 
merely in strong but also in strong L^, and consequently /„ converges uniformly to a Gaussian. 
Thus 

sup |/„(x) — g(x)| — » as n ^ 00, 

X 

which rules out a number of physically uninteresting pathological cases, such as /n(x) converging to 
the Gaussian g'(x) for all x except for a set of measure zero. 

Lemma C6: If W is a Wigner function, then W{2.) < W{Q) = 1, where the inequality is strict 
if z ^ 0. 

Proof: For a normalized wavefunction ^ in n dimensions (a pure state), the Fourier transform 
of the Wigner function is 

l?(k, x) = y" e-^(''i+" P%(q, p) (Tp = J e-* 'iV'(q - x/2)* V(q + x/2) 

where the integral is to be taken over all space. Thus 

|W^(k,x)|< J |V(q-x/2)||V(q + x/2)|d"g. 

Using the trivial inequality AB < {A^ + B^)/2 (with strict inequality unless A = B), we obtain 

|tF(k,x)| < |^(q-x/2)|2rf"g+i J |V(q + x/2)|2d"g 
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The case of a mixed state, whcac; the; Wigncir function is a weighted average of Wigner functions of 
pure states, foUows directly from superposition. 

That we have strict inequality for z ^ is seen as follows: The second inequality above is an 
equality only if '4>{ci — x/2) = + almost everywhere, i.e. if ip has period x. But since 
/ tp{q)d"-q = 1, tp cannot be periodic, and the only possibility is x = 0. Thus setting x = in the 
first inequality and subtracting unity from both sides shows that we have equality only if 

J [l-cos(k.q)] |V(q)|'rf> = 0. 

Since the integrand is non-negative, it must vanish identically. Since ||-!/'||2 = 1 > 0, cannot vanish 
almost everywhere, and the only possibility is k = 0. 

Theorem III: The Gaussian result in Theorem (II) is always obtained if / is a Wigner function 
with finite first and second moments. 

Proof: Let /(z) = W{z) = W{p, q) be a Wigner-function in n dimensions, i.e. take d = 2n. We 
only need to check that all the conditions of Theorem (II) hold, i.e. that all such Wigner functions 
are indeed proper quasi-probability densities. All Wigner functions integrate to unity and are square- 
integrable (indeed ||H^||2 < (27r?i)~"/^, with equality only for pure states [14]). Lemma C6 showed 
that all Wigner functions are indeed proper. Thus all that remains to be shown is that the covariance 
matrix V — C \s strictly positive definite. This is a well-know fact, basically a corollary to the 
multidimensional micertainty principle uncertainty, but we give a brief proof here for completeness. 

Without loss of generality, we may assume that the Wigner function corresponds to a pure state, 
since the covariance matrix of a mixed state is simply the weighted average of the covariance matrices 
in the mixture, and the weighted average of positive definite matrices is always positive definite. That 
V is positive semidefinite follows immediately from the fact that Wigner functions are proper: 

|W(z)|2 = l-F,.fcz,zfc + 0(|z|3), 

where j and k are to be summed over from 1 to d, so if V would have a negative eigenvalue, then there 
must exist a point near the origin where \W\ > 1, a contradiction. The multidimensional uncertainty 
relationship [30, 31] states that 

detF > (?i/2)2" 

for all Wigner functions, with equality only for Gaussian pure states, so none of the eigenvalues of V 
can vanish, and V must be positive definite. Thus all Wigner functions are proper quasi-probability 
densities, and the proof is complete. 

Note that in contrast to the case of positive densities, second moments can vanish not merely in 
pathological cases, but also for well-behaved functions. Such an example is 

W{z) = e-'\ 

for which V = 0. Also note that the requirement that the first and second moments be finite is 
necessary for the classical CLT as well. Finite first and second moments with respect to momentum 
is equivalent to the kinetic energy being finite. Finite first and second spatial q-moments can be 
interpreted as the system being spatially localized. Indeed, if the Hamiltonian is quadratic and 
positive definite (as it was in all cases treated in this paper), then all first and second moments must 
be finite if the total energy of the system is finite. 
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